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ABSTRACT 

A web content hosting service provider needs to dynami- 
ccdly allocate servers in a server farm to its customers' web 
sites. Ideally, the allocation to a site should always suffice 
to handle its load. However, due to a limited number of 
servers and the overhead incurred in changing the alloca- 
tion of a server from one site to another, the system may 
become overloaded. The problem faced by the web hosting 
service provider is how to allocate the available servers in 
the most profitable way. Adding to the complexity of this 
problem is the fact that future loads of the sites axe either 
unknown or known only for the very near future. 

In this paper we model this server aliocation problem, and 
consider both its offline and online versions. We give a poly- 
nomial time algorithm for computing the optimal offline al- 
location. In the online setting, we show almost optimal al- 
gorithms (both deterministic and randomized) for any pos- 
itive lookahead. The quality of the solution improves as the 
lookahead increases. We also consider several special cases 
of practical interest. Finally, we present some experimental 
results using actual trace data that show that one of our 
online algorithm performs very close to optimal. 

Interestingly, the online server allocation problem can be 
cast as a more general benefit task system that we define. 
Our results extend to this task system, which captures also 
the benefit maximization variants of the A:-server problem 
and the metrical task system problem. It follows that the 
benefit maximization variants of these problems are more 
tractable than their cost minimization variants. 
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1. INTRODUCTION 

Web content hosting is an important emerging market. 
Data centers and web server "farms" are proliferating (see, 
e.g., [19, 20]). The rationale for using such centers is that 
service providers can benefit from economies of scale and 
sharing of resources among multiple customers- This bene- 
fit translates, in turn, to lower cost of maintenance for the 
customers who purchase these hosting services. Web con- 
tent hosting services are structured in many ways. One of 
the most prevailing ways is outsourcing: the customers de- 
liver their web site content to the web content hosting service 
provider, and the service provider is responsible for serving 
this content in response to HTTP requests. Most of the ser- 
vice providers use "farms" of commodity servers to achieve 
this goal. 

One of the components in the payment for such a ser- 
vice is "pay per served request". Thus, one of the main 
objectives of the service provider is to maximize the revenue 
from served requests while keeping a tab on the amount of 
resources used. Idezdiy, the allocation to a web site should 
always suffice to serve its requests. However, due to a lim- 
ited number of servers and the overhead incurred in chang- 
ing the allocation of a server from one site to another, the 
system may become overloaded, and requests may be left 
unserved. We make the realistic assumption that requests 
are not queued, and a request is lost if it is not served at 
the time it is received. The problem faced by the web host- 
ing service provider is how to utilize the available servers in 
the most profitable way. Adding to the complexity of this 
problem is the fact that future loads of the sites are either 
unknown or known only for the very neau* future. In this 
paper we model this problem, and consider both its ofHine 
and online versions. 

In our model time is divided into intervals of fixed length. 
We assume that each site's demand is uniformly spread 
throughout each such interval. Server allocations remain 
fixed for the duration of an interval. We assume that servers 
axe reallocated only at the beginning of an interval, and that 
a reallocated server is unavailable for the length of the inter- 
val during which it is reallocated. This represents the time 
to "scrub" the old site (customer data) to which the server 
was allocated, to reboot the server and to load the new site 
to which the server has been allocated. The length of the 
time interval is set to be equal to the nonnegligible amount 
of time required for a server to prepare to serve a new cus- 
tomer. In current technology, this time is in the order of 5 
to 10 minutes. In the sequel, we normalize time so that the 
length of a time interval is one unit. 
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Table 1: Results 



Each server has a rate of requests it can serve in a time 
interval. For simplicity, we assume that all rates are identi- 
cal. Due to practical concerns (security constraints placed 
by customers), we do not allow sharing of servers at the same 
time. Customers share servers only in the sense of using the 
same servers at different times, but do not use the same 
servers at the same time. Thus, even in case of overload, 
some of the servers may be underutilized if they are allo- 
cated to sites with rates of requests lower than the servers' 
rate. In the sequel, we normalize rates so that the servers* 
rate is 1, and allow the demand of sites to be fractional. 

We assume that each customer's demand is associated 
with a benefit gained by the service provider in case a unit 
demand is satisfied. Given the number of available servers, 
the objective of the service provider is to find a time- varying 
server allocation that maximizes the benefit gained by satis- 
fying sites' demand. We call this server allocation problem 
the web' server farm problem. In the offline version of this 
problein, future demands of the sites are known. In the 
fully online version, only the demand of the current interval 
is known at the beginning of the interval. It makes sense to 
assume that some amount of future denLand is known to the 
service provider, and we model it by lookahead. Lookahead 
L means that the demand of the sites is known for the next 
L time intervals (following the current interval). In reality, 
lookahead will require that some forecasting mechanism be 
used to predict future demands. Fortunately, web traffic ex- 
hibits properties of predictability over the short term which 
render this practical as demonstrated in [13, 18]. We mea- 
sure the performance of an online allocation algorithm using 
its competitive ratio; that is, the maximum over all instances 
of the ratio of the benefit gained by the optimal offline allo- 
cation to the benefit gained by the onhne allocation on the 
same instance. Since we are dealing with a maximization 
problem, this ratio is always greater than 1, and the goal is 
to make it as small as possible. 

Interestingly, our model can be cast as a more general 
benefit task system. In this task system we are given a set 
of states for each time t and a benefit function. The system 
can be in a single state at each time, and the benefit gained 
between times t and t + 1 is a function of the system states at 
times t and ^-f- 1. The goal is to find a time- varying sequence 
of states that maximizes the total benefit gained; that is, 
at each time t we need to determine to which state should 
the system move (and this will be the state of the system at 
time t+l), and we gain the benefit that is determined by the 
benefit function. Similar to the web server farm problem, in 
the offline version, the benefit function is known in advance- 
In the fully online version, only the possible benefits of the 
current step are known at the time a move is determined, 
and in case of lookahead L, the benefits of the next L steps 
(in addition to the current) are known. Note that number 
of states in a benefit task system might be super-polynomial 
if the states are not given explicitly. 



It can be shown that benefit task systems capture also 
benefit maximization variants of well studied problems, such 
as the fc-server problem [16, 9] and metrical task systems [10, 
9] (see Section 2). Thus, our results hold for these variants 
as well, and show that the benefit variants of these problems 
may be more tractable than their cost minimization variants. 

Our results 

We observe that the offline version of the web server farm 
problem can be solved in polynomial time. In the online 
setting, we first note that the competitive ratio in the fully 
online version is unbounded; we then consider the case of 
a positive lookahead for both deterministic algorithms and 
randomized ones. We obtain tight bounds on the competi- 
tive ratio for the randomized case and almost tight bounds 
for the deterministic case. In the deterministic case we also 
consider the special case L = 1. The specific results are 
summarized in Table 1. Our bounds on the competitive 
ratio extend to the benefit task system problem, assuming 
that an optimal sequence of moves for the period of the 
lookahead (i.e. for the next L -I- 1 time steps) is efficiently 
computable. Therefore, our upper bounds (i.e. algorithms) 
are presented in the more general framework of the benefit 
task system problem, and the lower bounds are given for the 
more restricted web server farm problem. 

In addition, we consider two special cases of the web server 
farm problem with lookahead 1. One is the basic case of one 
server and two sites. For the competitive ratio in this case we 
obtain a lower bound of 3^3/2 « 2.598 and an upper bound 
of IH- 0 « 2.618, where <l> = is the golden ratio. The 

second case is that of k servers and arbitrary number of sites, 
for which we give an algorithm that achieves a competitive 
ratio 2 if A; is even and 2 + 2/(2A: - 1) if it is odd. 

Several of our algorithms use the natural approach of "dis- 
counting the future". Specifically, when the algorithm faces 
the choice between a guar2Lnteed benefit immediately and 
a potential benefit in the future, the decision is made by 
comparing the guaranteed benefit value with a discounted 
value of the potential future benefit. This discount factor is 
exponential in the number of time units that it would take 
the potential benefit to be materialized. The reason for dis- 
counting future benefits is straightforward. By the time a 
benefit will be materialized, things might change and the al- 
gorithm might decide to make another choice for a potential 
(even larger) benefit. 

Another approach used by our algorithms in the case of 
lookahead is an "intermittent reset". Some number of future 
steps are determined and then a single time step is "wasted" 
to reallocate the servers in preparation for another sequence 
of future steps. The time step for reallocation is chosen so 
that the amount of benefit unrealized due to the reallocation 
is guaranteed to be small with respect to the benefit that is 
gained. 
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We use actual web traffic data of commercial sites and 
randomly generated data to evaluate our algorithms. Our 
experiments show that the discount algorithm performs bet- 
ter, with less lookahead, than the intermittent reset algo- 
rithm, and is within 0.2% of optimal on all the commercial 
trace instances, even with lookahead one. 

Related work 

Our web server farm problem can be viewed as a scheduling 

problem. The requests are represented by unit length jobs, 
and the k servers are machines. Each job has a deadline 
which is exactly one time unit aifter its release, and a value. 
The goal is to maximize the throughput; that is, to maximize 
the value of the scheduled jobs. An additional factor is setup: 
it takes one time unit to set up a machine to handle a job 
for one web site if it handled a job in the previous time unit 
for a different site. 

Several papers have considered the problem of throughput 
maximization in scheduling jobs on parallel machines with 
release times and deadlines, both in the offline and online 
settings [5, 3, 7, 8, 14]. In the standard notation, this type of 
problem is denoted by P\ri\ Wi{l - Ui), The type of setup 
considered here is related to "batch setup" scheduling, for 
which several offline problems have been studied (see [17] for 
a survey). The online problems considered here also relate 
to previous work on online call admission and bandwidth 
allocation (12, 15, 4, 6]. Again, the issue of setups does not 
appear in- any of these papers. 

The benefit task system can also be viewed as a profit 
maximization version of the request-answer game of [9]. To 
the best of our knowledge this is the first attempt to con- 
sider a task system for profit maximization rather than cost 
minimization problems. Awerbuch et al. [2] considered a 
similar problem in which a decision maker that has no way 
to accurately predict future performance of several options 
has to choose one option that will have the best future per- 
formance. This problem has an unbounded deterministic 
competitive ratio, and the authors propose a randomized 
algorithm. Lookahead is not considered in [2]. 

Organization, In the next section we give more accurate 
definitions of our models and show their relations to the 
/^-server model arid metrical task systems. In Section 3 we 
describe an offline algorithm for the server farm problem. In 
Section 4 we devise two deterministic online algorithms for 
the benefit task system, based on discounting the future and 
on an intermittent reset, and we give upper bounds on their 
competitive ratio. In Section 5 we prove lower bounds on the 
competitive ratio of any deterministic online algorithm for 
the more restricted web server farm problem. In Section 6 
we consider some special cases of the online deterministic 
web server farm problem. In Section 7 we give tight bounds 
for the randomized case. We conclude in Section 8 with 
some experimental results based on traces of commercial 
retail sites. 

2. THE PROBLEIVIS 

The vireb server farm problem: Suppose that we are 
given 5 web sites that are to be served by k web servers. (For 
simplicity, we assume that all servers are identical.) Time 
is divided into units (a.k.a. steps). It is assumed that the 
demand of a web site is uniform in each time unit. Each 



server has a "service rate" which is the number of requests 
to a web site each server can serve in a time unit. Without 
loss of generality, we normalize the demands by the service 
rate so that a server can serve unit demand per time unit 
and demands of a site may be fractional, A web server can 
be allocated to no more than one site at each time unit and 
it takes a time unit to change the allocation of a server. 

A problem instance consists of the number of servers ik, 
the number of sites s, a nonnegative benefit matrix {6t,t} 
denoting the benefit gained by serving a request of site % G 
[l.,5] for time step t € [l. r], and a nonnegative demand 
matrix {<fi,t} denoting the number of requests at site t for 
time step t. The goal is to find for each site i e [I..5], a 
time- varying allocation {ai,f} of servers, so as to maximize 
the total benefit, as follows. The allocation must satisfy that 
for each t, X^f^^ ai,t < k. Only a-,t = min{ai,i_i , a.,* } of the 
servers allocated to site i for time step t are "productive" , 
i.e., actually serve requests. We get that the total benefit 
of an allocation {ai,t} is Yl^^^ Ei=i • min{di,t, a^, J = 
Z)e=i 5^i=i ^».* ' ^^^{<^i.t,0'i,t,a-i,t-'i}' The initisJ allocation 
is fixed to be, say, at,o = 0 for all i. 

In the offline version of this problem we are given the 
complete demand matrix {dt.t} and we need to compute 
the complete allocation {at.t}. In the fully online version 
we need to compute the allocation {at,t} at time t given 
the demands so far, i.e., {di^t'} for all i and t' < t. It is 
not difficult to show that cmy fully online algorithm may 
perform arbitrarily badly with respect to the ofliine optimal 
solution. This is true because of the one time unit lag in 
a server's availability. Thus we consider online algorithms 
with lookahead. In the online version with lookahead L, 
at each time t we are additionally given the demands and 
benefits for times i+l, . . . , £H-L, i.e., the entries di^t' and 6^,^/ 
for i e [I..5] and 6 [i + l..< + L], and we need to compute 
the allocation at time t, i.e., Ot.t for i 6 [I..5]. 

The benefit task system problem: The web server 
farm problem is a special case of the generalized task sys- 
tem benefit problem. In this problem we are given (i) a set 
of possible states Ut, for each time t € [0..r], and (ii) a non- 
negative benefit function B whose domain is \J^{Ut x t/^+i), 
that denotes, for each time the benefit that is accrued (at 
time £ + 1) by the transition from a state in Ut to a state in 
Ut+i. The goal is to choose a state st for each time * so as 
to maximize the total benefit Y^^^ B(5t,5t+i). The initial 
state is fixed by letting that Uo = {so}. 

In the offline version of the problem all the state sets and 
the benefit function are known in advance. In the online 
version with lookahead L, the state st € Ut should be com- 
puted based on knowing only Ut^ for < t + L, and the 
restriction of the function B to pairs of these sets of states. 

Observe that the web server farm problem can be cast in 
this setting by identifying each possible allocation of servers 
to sites at time t with a state Si,t, and defining the benefit 
function B(Si,ty Sj^t+i) to be the benefit gained by (serving 
some number of requests while) changing the allocation at 
time t from the one represented by 5i,t to the sillocation at 
time t+1 represented by 5;,f+i. (In a sense, the set of states 
for all times are the same.) The number of states is exponen- 
tial in the number of servers fc, so the states and the benefit 
function are imphcit and follow from the more succinct rep- 
resentation of the web server farm problem. For example, 
the values B(5i.£, 5,,t+i) are not listed explicitly, and any 
single value can be efficiently computed when necessary. 



542 

BEST AVAILABLE COPY 



Benefit maximization variants of well known problems can 
also be cast in this setting of a benefit task system. Consider 
the benefit version of the A;-server problem. This version of 
the Ar-server is similar to the classical A:-server problem [16] 
with one difference. Instead of minimizing the cost of satis- 
fying all requests, we define for each time t and any possible 
pair of server configurations, one at time t -1 and one at 
time t, a net benefit gained by satisfying the request starting 
from the configuration at time t — 1 and ending at the con- 
figuration at time t, (Note that the configuration at time 
t must include at least one server at the point of the re- 
quest.) This benefit has to be nonnegative and is composed 
of a fixed positive component that is reduced by the cost to 
move from the configuration at time * — 1 to the configu- 
ration at time t. The goal in this case is to maximize the 
total benefit. It is not difficult to see that this problem can 
also be modeled by the benefit task system defined above 
and thus all our results apply to the benefit version of the 
A:-server problem. 

Similarly, consider the benefit version of a metrical task 
system. This version is similar to the classical metrical task 
system [10], with the difference that each task is associated 
with a vector of benefits, one for each state, such that the net 
benefit after subtracting the transition cost is nonnegative. 
This model as well can be cast as a benefit task system and 
our results apply. 

3. OFFLINE ALGORITHIVI FOR THE WEB 
SERVER FARjVf PROBLEM 

In this section we show that the offline web server farm 
problem can be solved in polynomial-time. Note that an 
algorithm that solves this offline problem is also a compo- 
nent in the deterministic online algorithms of Section 4, as 
these online algorithms rely on an abihty to find an optimal 
sequence of eillocations in the limited future given by the 
lookahead (i.e., the next L -f 1 time units). 

We reduce the web server farm problem to the well-known 
minimum-cost network flow problem, which can be solved in 
polynomial time; see for example [1, 11]. We remark that 
when 5* is polynomial in the input size, the web server farm 
problem can be solved in polynomial time also using dy- 
namic programming! However, in general k might be expo- 
nential in the input si^. 

Theorem 1. The offline web server farm problem can be 
reduced in polynomial time to a minimum-cost network flow 
problem. HencBj it can be solved in polynomial time. 

Proof. Recall that the input for the minimum cost net- 
work flow problem is a directed network, two special vertices 
called the source vertex and the sink vertex, an amount of 
flow to be injected into the source vertex, and a nonnegative 
capacity and a cost for each edge. The goal is to find from 
all the flows from s to t that respect the edge capacities and 
are of size fc, one that has a minimal cost, where the cost of 
a flow is the sum, over all edges, of the product of the flow 
on this edge and its cost. 

In fact, we describe below a reduction from the web server 
farm problem to the analogous maximum-cost network flow 
problem. The latter problem is essentially the same as the 
minimum-cost network flow problem (and thus Ccm be solved 
in polynomial time), since the edge costs are not restricted 



in sign, i.e., are allowed to be negative. We remark that in 
our network, all the paths fi*om 5 to ^ are of equal length 
and therefore another way to guarantee that all costs are 
nonnegative is to increase the costs of all the edges by the 
same sufficiently large number. 

The outline of the reduction is as follows. Given an in- 
stance of the web server farm problem, we construct a (di- 
rected) network with s **rails" , one per site. Each rail is a 
chain of edges, each representing one time step. (Actually, 
we will split these edges into three parallel edges, for reasons 
which will become clear shortly.) Flow along a rail repre- 
sents the allocation of servers to the corresponding site. In 
addition, we construct a set of "free pool" nodes, one per 
time step, through which flow will pass when servers are 
reallocated from one site to another. 

Let {rfi.t} with i e [I..5] and t 6 [l..r] be the demand 
matrix- We construct nodes Ui^t for i € [I..5] and t G [l..r], 
along with nodes ft for i € [l..r]. For each site s and each 
time step we construct three edges from ni,t-i to n,-,t. 
The first has capacity [rf<,tj and cost 6i,t. The second has 
capacity one and cost bi,t • {di^t -[di,t\). The last has infinite 
capacity and cost 0. Flow along the first edge represents the 
benefit of edlocating servers to site i during time step t, up 
to the integer part of di^t. Flow along the second represents 
the remaining benefit, bi^t times the fractional part of dt,t, to 
be collected by one more server (that will be only partially 
productive). Flow along the third represents the fact that 
extra servers can remain allocated to i, but do not collect 
any benefit. Clearly the first edge will saturate before flow is 
present on the second, and similarly the second will saturate 
before flow is present on the third. 

We also construct edges of infinite capacity and cost 0 
from n,,t-i to ft and from ft to ni,t, for each t 6 [l..r] and 
i 6 [I..5]. These represent the movement of servers from one 
site to another. (We use an intermediate node ft to keep the 
mmiber of edges linear.) 

Finally, we construct a source into which we inject flow 
k, with infinite capacity zero cost edges to each n,,!, and a 
sink with infinite capacity zero cost edges from each n»,r. 

It is not hard to see that an integral flow of cost C in 
this network corresponds to an allocation {ai,t} with bene- 
fit equal to C, and vice versa. It is well known that since 
all the edge capacities aire integral, there is minimum-cost 
(or maximum-cost) flow in the network that is integral, and 
that, furthermore, such a flow can be efficiently found. This 
implies an efficient algorithm that finds an optimal alloca- 
tion {a<,t}. Note that the size of the network (number of 
nodes and edges) is linear in the size of the demand ma- 
trix {di,t}y and thus the offline web server farm problem can 
be solved within roughly the the same time as the best al- 
gorithm for minimum-cost network flow on a network with 
0{s • r) nodes and edges. □ 

4. DETERMINISTIC ONLINE ALGORITHJVIS 
FOR BENEFIT TASK SYSTE]VfS 

In this section we describe two deterministic algorithms 
for benefit task systems with lookahead L, The first al- 
gorithm is a future discounting algorit hm that achieves a 
competitive ratio of (1 -f- 1/L) kJL + 1. Asymptotically in 
L this ratio is 1 -h 0{\ogL)/L. The second algorithm is an 
intermittent reset algorithm that achieves a competitive ra- 
tio of 1+ 4/ (L — 7), which is better than the first algorithm 
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when L is sufficiently large. Each algorithm requires that 
one can efficiently find a sequence of moves that is optimal 
(in a certain sense) for the period of the lookahead (i.e. for 
the next L + 1 time steps). In the particular case of the 
web server farm problem, we can achieve this requirement 
by using the offline algorithm of Section 3. 

4*1 The future discounting algorithm 

We present a determini stic al gorithm that achieves com- 
petitive ratio (1 -f l/L) y/L + 1 for the benefit task system 
problem. The algorithm is based on discounting future ben- 
efits in an exponential way, as follows. Consider a sequence 
of L-f 1 moves that collects the benefits 6o, 6i , . . . , (in this 
order) in the next L+l time steps. We define the anticipated 
benefit of this sequence to be 6o -h 6i /a -h . . . 4- 6l / a^ (for a 
p2urameter a > 1 that we later choose to be a = y/L + 1). 

The discounting algorithm greedily follows at each time 
step the largest anticipated benefit possible at that time. 
Namely, at every time step the algorithm finds a sequence 
of L -{- 1 moves (for the next L -\- \ time steps) whose an- 
ticipated benefit is maximal, and performs the first move in 
this sequence. We stress that a new sequence is calculated 
at every time step, and that the sequence found in the cur- 
rent time step does not have to agree with the one found in 
the previous time step. 

Note that this discounting algorithm requires efficient com- 
putation of a sequence of L + 1 moves with the maximum 
anticipated benefit. The following lemma provides this re- 
quirement when the number of states at every time t is poly- 
nomial in the input size, e.g. when the states of the benefit 
task system are given explicitly. In the web server farm prob- 
lem, the number of (implicit) states is super-polynomial, but 
we can use the offline algorithm of Section 3, with straight- 
forward modifications for handling the initial state and the 
discount. 

Lemma 2. Suppose that the number of states possible for 
any one time is at most M, Then a sequence of La-I moves 
with the largest anticipated benefit can be computed in time 
that is polynomial in' M (and L). 

Proof. Use dynamic programming that goes iteratively 
over the time steps and accumulates discounted benefits. As 
the additional benefit of the next time step depends only on 
the current state, it suffices to have for each time step t a 
table of size Af , and compute the table for time step ^ + 1 
from that of time step t. □ 

Theorem 3. The competitive rati o of th e above discount- 
ing algorithm is at most (1 -h \/L) \/L + 1. 

Proof. Consider among the sequences of L -f- 1 moves 
that are available for the online algorithm at time the se- 
quence with the largest anticipated benefit, and let 6o, 6i , . . . , 6i: 
be the benefits collected by this sequence. Define ONt = bo 
and OA^;A = 6i/q -h . . . -h fti/a^, and then ONt -h 07V;/i 
is the largest anticipated benefit over all sequences of L + 1 
moves that are available to the online algorithm at time 
t. The online algorithm performs the first move in this se- 
quence, and thus the benefit it collects at time * is ONt. 

Fixing an offline algorithm arbitrarily, let OFFt denote 
the benefit that this offline algorithm collects at time t, and 
let OFFt^ be a shorthand for OFFt+i/a-h. . ,+OFFt+L/oi^ . 



One sequence of moves that the online algorithm considers 
at time t is to first join the offline algorithm, (i.e., move 
to the sanie state as the offline algorithm at time £), and 
then follow the offline algorithm for the next L steps. The 
anticipated benefit of this sequence is at least OFFt* Since 
the online algorithm follows at time t the sequence of moves 
with the largest anticipated benefit, we get that 

ONt + ON:fi > OFFt^^ (1) 

Another sequence of moves that is considered by the on- 
line algorithm at time t is to follow the sequence that had 
the largest anticipated benefit at the previous time step t — l 
(without the first step of that sequence, which was per- 
formed at time t — 1, and with an arbitrary move added 
at the end of the sequence.) The contribution of these ben- 
efits to the anticipated benefit at time t is larger by a factor 
of Q than their contribution at time < — 1, and so the antic- 
ipated benefit of this sequence of moves is at least aON^^. 
Since the online algorithm follows at time t the sequence of 
moves with the largest anticipated benefit, we get that 

ONt + ON:ii > aON;^ (2) 

Adding (1 - ^) times inequality (1) and ^ times inequal- 
ity (2), we get that^ 

ONt + Oiv;^i > on;'' -i- (i - \)OFFt\, . (s) 

Adding up inequality (3) over all time steps t we get that 
(it is straightforward that we can assume, without loss of 
generality, that all the benefits in the first and last L -h 1 
time steps are zero regardless of the state): 

t t 

The last equality follows since every benefit that the offline 
collects occurs in Ylt ^^^t+i with each of the discounts 
1/a, . . . , We conclude tha t the competitive ratio of 

the algorithm is (choosing a = \/L 4- 1): 

EtOFFt ^ 1 g^-^' /l_L1y^^^/r-^T 

□ 

4.2 The intermittent reset algorithm 

We present a deterministic intermittent reset online algo- 
rithm that has competitive ratio 1 + 0{1/L) for the benefit 
task system problem. This algorithm is motivated by the 
raindomized algorithm presented in Section 7 that sacrifices 
one out of every L + 1 steps in order to gain the optimal ben- 
efit between sacrificed steps. Here, we sacrifice one out of 
every S{L) steps, but the choice of steps to sacrifice is done 
carefully in a deterministic fashion and the overall benefit 
that we gain is close to the expected benefit of the random- 
ized algorithm. 

The algorithm works in iterations of length at least L/2 
and at most L. Let 5, denote the start time of iteration L 
Each iteration i consists of four steps, as follows, 

"^In a sense, this is a potential function analysis. See for 
example [9]. 
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Step l: Consider all times < = 5i + Ljl + 1 . . . , 5i -I- L. For 
each such time i, let xt denote the maximum benefit 
of any transition from a state at time it to a state at 
time 1. Let T be the time t that minimizes xt^ i.e., 
XT = min{a:t : 5i + L/2 + 1 < * < 5i + L}. 

Step 2. Compute the optimal sequence of transitions that 
starts at any state at time 5< -h 1 and ends at any 
state at time T. This can be done either by dynamic 
programming similar to Lemma 2 or by applying the 
offline algorithm of Section 3. (In the first iteration 
the initial state is given sind thus we may compute the 
optimal sequence of transitions from the initial state to 
any state at time T. The next step has to be modified 
accordingly.) 

Step 3: Move from the current state at time Si to the state 
at time 5^ + 1 that is the starting state of the opti- 
mal sequence computed above, and continue with the 
. optimal sequence of moves to the state at time T. 

Step 4: Set the starting point -T for the next itera- 
tion. 

Theorem 4. The competitive ratio of the above intermit- 
tent reset algorithm is at most 1 -h - 

Proof. We assume that X is a multiple of 4, and oth- 
erwise round it downwards to the nearest multiple of 4 (at 
the expense of increasing the competitive ratio as explained 
later). We will call the starting times of the iterations 
"gaps," Denote the gaps by 5i , 52, • . We divide the tran- 
sitions taken by the algorithm into two components, the 
moves taken in the gaps and the steps taken in the rest of 
the times. Consider the path taken by the online algorithm. 
It may not collect any benefit in the gaps, but since it com- 
putes the optimal paths between gaps (allowing arbitrary 
initial and terminal states adjacent to the gaps), clearly the 
benefit of the optimal offline between gaps is no larger than 
the benefit of the onHne algorithm between gaps. 

Now we bound the benefit of the optimal ofRine algorithm 
in the gaps. Consider iteration i starting at Si and comput- 
ing T = Si+i. Consider the portion of the online path from 
Si + L/2.-h 1 to Denote the length of this portion by 

a, i.e., a = 5,-+! - (si -f L/2 -f- 1). Consider the portion 
of the path computed in the next iteration from 5^+1 -hi 
to Si -I- L -f 1. Denote the length of this portion by 6, i.e., 
6 = 5i H- L -h 1 - (5i+i -hi). By our choice of L, L/2 is even 
and a -h 6 = L/2 - 1. Thus one of a and b is even and the 
other is odd. - Assume a is odd and 6 is even; the proof is 
similar in the opposite case. (We note that ideally, we would 
like both a and b to be even, but choosing a -h 6 to be even 
may lead to a worse case when both are odd.) 

We claim that the benefit achieved by the online algorithm 
in these portions of the path is at least a;r • (a + 6 - l)/2. 
This is true since in every time step in the range there is a 
transition of benefit xt and thus the online algorithm can 
achieve benefit at least xt on every other transition, even 
if it collects no benefit on the other half of the transitions. 
Since a is odd, it can collect at least xt ■ (a - 1)/2 in the first 
portion, and since 6 is even, it can collect at least xr ♦ 6/2 
in the second. 

Putting these together along with a + 6 = X/2 — 1 we have 
that the online benefit is at least 

. .a--1^6. L-4 



Since Si -h L -h 1 is always less than 5^+1 + L/2 -h 1, these 
portions are disjoint among iterations. Summing over all 
iterations we have ON - (j^) > ^.xr,, where ON de- 
notes the total benefit collected by the online algorithm 
and XTi denotes the maximum benefit possible in the ith 
gap. Accounting for the portions outside the gaps we have 
ON > OFF - J2i^Tn where OFF denotes the benefit of 
the optimal ofliine algorithm. Putting these together and 
allowing for values of L that are not multiples of 4 yields 
the desired bound of 1 + j;^. □ 

5. LOWER BOUNDS FOR DETERMINIS- 
TIC ONLINE ALGORITHMS 

In this section we prove three lower bounds on the com- 
petitive ratio of deterministic online algorithms for the web 
server farm problem with one server. The first lower bound 
is for general lookahead, the second is for lookahead 1 (and 
arbitrary number of sites), and the third is for lookahead 1 
and two sites. 

5.1 A lower bound for general lookahead 

We show a lower bound of 1 -h 2; + 7-2^ on the competitive 
ratio of a deterministic online algorithm for the web server 
farm problem with one server. The same lower bound then 
follows for the benefit task system problem. Formally, we 
have the following theorem. 

Theorem 5. The competitive ratio of any deterministic 
online algorithm for the web server farm problem with one 
server (and hence for the benefit task system problem) is 
larger than 1 -f r + Z^- 

Proof. Consider a deterministic onhne algorithm whose 
input is a 3x (L+3) demand table {d^.t }. Since the algorithm 
has lookahead L, it is given at time ^ = 0 only the first LH- 1 
columns, as illustrated below. Since the first column is all 
zeros, we may assume without loss of generality that the 
server is initially allocated to the first site. We denote the 
sequence of locations of the server by triangles, and use a 
parameter a that will be determined later. 



t>0 1 a 
0 1 a 
0 1 a 



a ? ? 
a ? ? 
a ? ? 



Since at time ^ = 0 all three rows look the same, we may 
assume without loss of generality that the algorithm decides 
to allocate the server at this time to the first site, and it 
collects a benefit of 0. Next, at time t = 1 another column 
is revealed to the algorithm, and the situation is as follows. 



>0 >1 a 
0 1 a 

0 1 a 



o 0 ? 
a a 1 

a a 1 



At time t = \ the algorithm can either allocate the server 
to the first site again or to another site. Consider first the 
case that the algorithm allocates the server to the first site 
again, so the algorithm collects a benefit of 1. In this case 
the last column is revealed (at time « = 2) to be all zeros, 
and the situation is the following. 



{dut} = 



0 
0 



t>l 
1 
1 



>a 
a 
a 



0 
a 
a 
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The total benefit that the online can collect in this case is at 
most 1 + (L — l)a, while the offline could have collected the 
whole second row, i.e., 1 + La, Therefore, the competitive 
ratio of the algorithm would be at least 



1 + (L - l)a 



= 1 + 



l + (L-l)a' 



Consider now the case that the algorithm allocates the 
server at time ^ = 2 to a site other than the first one, so it 
collects a benefit of 0. Without loss of generality we may 
assume that the server is allocated to the second site. At 
time t = 2, the last column is revealed to the algorithm, and 
the situation is the following. 



>0 >1 a 
0 1 t>o 
Ola 



a 0 0 
a a 0 
a a a 



The total benefit that the online can collect in this case is 
at most La, while the offline could have collected the whole 
third row, i.e., 1 -j- (L-h l)a. Therefore, the competitive ratio 
of the algorithm would be at least 



l+(L + l)a 
= 



1 + 



La ' 



We conclude that the competitive ratio of any determin- 
is tic algo rithm is at least min{r',r"}. Choosing a = (Z/ + 
VL^ + 4)/2, we get that a^ = 1*+ La and hence 

, „ 1 ^ a + 1 1.1.1 
r =r =1 + — — = i-f_ + _. 

La L aL 

Since a < L~h l/L, we get that min{r',r"} > 1 + ^ + 
as claimed. □ 
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This lower bound (slightly) improves over the 1+ j straight- 
forwzurd lower bound. As the latter bound is the competitive 
ratio threshold for randomized aJgorithms (see Section 7), 
this proves that for any fixed lookahead L, the thresholds of 
• deterministic and randomized algorithms are different. 

5.2 Lower bounds for lookahead one 

We prove two lower bound results for deterministic algo- 
rithms for the web server farm problem with one server and 
lookahead 1. The first result shows that no algorithm can 
achieve a competitive ratio better than 4 for this problem. 
This matches oiu* algorithm firom Section 4. 1 that achieves a 
competitive ratio 4. The second result shows that in the case 
where the number of sites is two, no algorithm can achieve 
a competitive ratio better than 3\/3/2 « 2.598. This is al- 
most tight with our algorithm firom Section 6.1 that achieves 
a competitive ratio roughly 2.618. The proofs of the follow- 
ing two theorems are omitted due to space constraints. 

Theorem 6. Consider the web server farm problem with 
one server and lookahead 1, For every c < 4, there exists an 
a > 0 such that the competitive ratio of any deterministic 
online algorithm for the problem with s sites is at least c. 

Theorem 7. Consider the web server farm problem with 
two site,s, one server and lookahead 1. For every c < 3\/3/2, 
the competitive ratio of any deterministic online algorithm 
for the problem is at least c. 



6. SPECIAL CASES 

In this section we consider two special cases of the web 
server farm problem. The first is the case of one server and 
two sites with lookahead one. The second is multiple servers 
with lookahead one. 

6.1 One server, two sites, and lookahead one 

We devise a deterministic online algorithm for the web 
server farm problem with one server, two sites and lookahead 
L = 1. The algorithm has competitive ratio r = l-f<^ where 
0 = ^'^^ « 1.618 is the golden ratio. This ratio is neao-ly 
the best possible in this case, as we present (in Section 5) a 
lower bound of s» 2.598 on the competitive ratio of any 
deterministic algorithm for this problem. We remark that 
the competitive ratio that we obtain in this case is much 
better than in the case of arbitrary number of sites, where 
the best possible competitive ratio is 4. 

The algorithm is based on discounting future benefits. It 
is similar to the algorithm presented in Section 4.1 (special- 
ized to the case L = 1), except that the discount factor is 
taken to be 0. The analysis for this case adds some compli- 
cation to that of the general case, in order to take advjmtage 
of the limited number of sites. For example, a discount fac- 
tor of 2 optimizes the analysis of Section 4.1, and then the 
competitive ratio is shown to be at most 4. The actu2Ll com- 
petitive ratio in this case is 3, which can be shown by an 
analysis similar to the one that we give here. 

The following theorem states the improved competitive 
ratio for this case. 

Theorem 8. The discounting algorithm with discount fac- 
tor 4> = ^-*2^ achieves competitive ratio r = 1 + ^ « 2.618 
for the web server farm problem with one server and two 
sites. 



Proof. We first describe the algorithm in more detail. 
We associate with each possible adlocation its anticipated 
benefit, which combines the immediate benefit of this allo- 
cation and its future benefit. To compensate for the uncer- 
tainty of a future benefit (the algorithm might later decide 
not to collect these benefits), we discount it (with respect to 
an immediate benefit) by a factor of <j>. In other words, an 
action that causes the algorithm to collect a benefit 6 and 
to have the potential to collect a benefit 6' in the future has 
auiticipated benefit of (/> • 6 -h 6' . 

Suppose that the situation of the online algorithm at some 
time i is as illustrated below. Assume, without loss of gen- 
erality, that the allocation of the server in the previous time 
unit is to the first site, as denoted by the triangle. Since the 
lookeihead is L = 1, the algorithm knows only the demands 
at the current time t and at the next time t -h 1. 

<*■.>=[:.:" n :::] 

The algorithm decides on the allocation of the server at time 
t as follows. If • X -h > J/' then the previous allocation is 
kept, i.e., the online algorithm allocates the server at time t 
to the first site. Otherwise, the algorithm changes the allo- 
cation and the server is allocated to the second site. Observe 
that this decision achieves the largest anticipated benefit for 
the algorithm. If it decides to stay with the allocation as 
before, it collects a benefit of x and will have the option 



546 



to collect at the next time < -f 1 an additional benefit x', 
yielding an anticipated benefit 0 • x -h x'. If the algorithm 
decides to change the allocation to the other site, then it 
collects no immediate benefit (at this point there is no way 
for the online algorithm to collect the benefit y), and in the 
next time < + 1 it will have the option to collect a benefit j/', 
yielding an anticipated benefit 4>'0-\- y' = y' . 
We will prove by induction on T that^ 

rY^ONi-^- . OJVf +1 > OFFt + OFFf+j (4) 

where ONt is the actual benefit collected at time * by the 
online algorithm; ON; is the potential benefit that the on- 
line algorithm will collect at time t if it allocates the server 
to the same site as at time t - 1] OFFt and OFF^* are de- 
fined similarly for an arbitrary ofiline algorithm; and qt = (f> 
if at time t - 1 the online and the offline algorithms have 
allocate the server to the same site (i.e., if they have the 
same "starting position" for time t), and = 1 otherwise. 
(Note that ONt is either ON! in case the potential benefit 
is gained or 0.) 

Proving (4) would complete the proof, since we can as- 
sume, without loss of generality, that all the demands in the 
first and last X + 1 time steps are zero, and thus obtain for 
the last time step T that r J2t ONt > J^t OFFt, as required. 

The base case of the induction follows from padding the 
demand matrix with zerps from both sides. For the inductive 
step, i.e., that the induction hypothesis (4) for time T follows 
from that of time T - 1, it suflSces to show that 

r • ONt + gr+i • OiVf +i - qr ■ ONt (5) 
> OFFt + OFFr+i - OFF^ 

We prove Equation (5) by verifying it over all cases. Denote 
the benefits at times T, r+ 1 by x, x', y, y' as above. Assume 
without loss of generality that at time T — 1 the server is 
allocated to the first site. The total number of cases is eight, 
since there are two possible decisions for the online, and four 
possible decisions for the offiine. 

Consider first the cases in which the allocation of the of- 
fline at times T and T -h 1 are the same. Then at time T 
the offline collects the benefit OFFt, and hence OFFr — 
OFFt. In addition ON^ = x, so inequality (5) simplifies to 

r . ONt 4- ^r+i • ONt+i - qT - x > OFFt^^. 

1. qT = <i>,qT-\-i = <f>' Then both offline and online collect 
X. We need to show that 

r-x-|-^-x'-^-x>x' 
i.e., that x + (<^ - l)x' > 0, which clearly holds. ^ 

2. qT = 1,^+1 = 1. Then offline collects y and online 
collects X. We need to show that 

r • X + x' - X > 

i.e., that </> • x + x' > j/', which holds since online keeps 
the previous allocation, y/ 

3. qT = <f>,qT+i = 1. Then offline collects x and onhne 
does not collect x. We need to show that 

rO + y- ^x>x' 

which holds since online changes allocation, y/ 

In a sense, this is a potential function analysis. See [9]. 



4. qT - l,7T+i = 0- Then offline collects y and online 
does not collect x. We need to show that 

rO'}-</>y'^x>y 

i.e., that y' > ^ . x = 0 • x, which holds since online 
changes allocation, y/ 

Consider now the cases in which the allocation of the of- 
fline at times T and T -h 1 are different. Then the benefit of 
the oflaine at time T is OFFt = 0. In addition OTVJ. = x, 
so inequality (5) simplifies to 

r • ONt + qT+i • ONt+i - qT - x > OFFt^i - OFF^ 

5. qT = <l>, qT+i = ^. Then offline and online both do not 
collect X. We need to show that 

rO-{-<l>-y-<f>'X>y-x 

i.e., that > x, which clearly holds since online changes 

allocation, 

6. qT = l,9r+i = 1. Then offline does not collect y and 
online does not collect x. We need to show that 

rOH-y'-x>x'-j/ 

and since online changes allocation we indeed get y' > 

0 • X -{- x' > X -f- x' — y. y/ 

7. qT = (p.qT+i = 1. Then offline does not collect x and 
online collects x. We need to show that 

r-x + x' — <^-x>y' — X 

i.e., that 2x -f x' > y', which holds since online keeps 
the previous allocation . 

8. qT = l,(7T+i = Then offline does not collect y and 
online collects x. We need to show that 

rx-f0x' — x>x'-y 

i.e., that 0 • x H- - l)x' + y > 0, which clearly holds. 

□ 

6.2 Multiple servers with lookahead one 

We present an online algorithm for the web server farm 
problem with multiple servers and lookahead one, as stated 
in the following theorem. We omit the proof of the lower 
bound for lack of space. 

Theorem 9. There is a deterministic algorithm for the 
web server farm problem that achieves competitive ratio 8 
when the number of servers k is even, and competitive ratio 
Cife = 2 + 2/(2A; - 1) when k > 3 is odd. Furthermore, no 
online algorithm achieves a lower competitive ratio. 

Proof. We distinguish between even and odd numbers 
of servers. For an even k, the algorithm splits the set of 
servers into two sets of equal size. The first set of servers 
is allocated so as to always collect the k/2 largest benefits 
on the odd time steps. These servers spend the even time 
steps in reallocation to those sites where they will be able 
to collect in the following time step, which is odd, the k/2 
largest benefits. On even time steps, these servers may col- 
lect zero benefit in the worst case. The second set of servers 
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BEST AVAILABLE COPY 



Workload 


Avail. 


Opt. 


Disc. L = l 


Disc. L = 4 


Reset. £ = 4' 




ran- 24 


9494 


7871 


7750 (98.5%) 


7803 (99.1%) 


7484 (95.1%) 


7753 (98.5%) 


coml-15 


4634 


3904 


3899 (99.9%) 


3901 (99.9%) 


3875 (99.3%) 


3894 (99.8%) 


coml-25 


4634 


4634 


4634 (100 %) 


4634 (100%) 


4629 (99.9%) 


4634 (100%) 


com2-15 


7265 


4227 


4217 (99.8%) 


4218 (99.8%) 


4163 (98.5%) 


4183 (99.0%) 


com2-25 


7265 


6013 


6007 (99.9%) 


6009 (99.9%) 


5966 (99.2%) 


5990 (99.6%) 



Table 2: Experimental results 



is scheduled in a symmetric way. It collects the k/2 largest 
benefits on the even time steps and spends the odd time 
steps on reallocation to the best sites for the following time 
step which is even. We conclude that at each time t, the 
total benefit that the k servers collect is at least as large as 
the k/2 largest benefits at this time t, which is clearly at 
least half the benefit that an optimal offline algorithm can 
collect at this time t. Therefore, the competitive ratio of 
this algorithm is at most 2. 

For an odd k, the algorithm splits the servers into three 
groups. Two groups are of size (A; - 1)/2 each, and the third 
group contains one server. The first group of servers collects 
the {k - l)/2 largest benefits on the odd time steps. The 
second group collects the (A: - l)/2 largest benefits on the 
even time steps. Let Ft denote the (A: + l)/2 largest benefit 
in step t. The last server tries to collect Ft on each time 
step, as follows. On the step t the server either collects Ft 
or it moves to the site q such that dg,t+i = P't-hi- The server 
collects Ft if 2Ft > Ft^i; otherwise the server moves to site 
If the server collects Ft on some step t, it sacrifices the 
step « + 1 on a move to the ^th best site q for step t H- 2, 
i.e., = Ft-f,2. 

For the sake of analysis we split time into phases: the 
first phase starts at time step 0 (by convention we assume 
that benefits at this time step are equal to zero). A phase 
ends at time step * + 1 if the server fi-om the third group 
collects Ft in step t. The next phase starts at step < + 2, and 
so on. Notice that the server from the third group always 
starts the phase allocated to the ^th best site. Consider 
the first phase. We claim that Ft > l/4X;*iJi^'. Indeed, 
2i^« < Pi-i-x for i = 1, . . . ,i - 1 and 2Ft > Therefore ' 

<+l t+l oo 

J2^'<^^Y1 V2*"* < 2Ft J2 1/2^ = 4Ft. 

For p<k, if 5(r,p) is a sum of the p largest benefits in step 
r, then Er*^(^P) >PlkY.rOFFr and 

> X;s(r,^)+F, 



8A: 
2A;- 1 



4A; 



OFF = cZ^OFF . 



The same proof holds for any phase, and therefore we 
have^proved that our algorithm is -competitive for odd 



7. RANDOMIZED ONLINE ALGORITHMS 

In this section we derive tight bounds on the competi- 
tive ratio of randomized online algorithms with lookahead 
L. These bounds are stated in the following theorem. We re- 
mark that our randomized algorithm requires only 0(log L) 
random bits, independent of the input length. 

Theorem 10. There is a randomized online algorithm 
for the benefit task system problem with competitive ratio 
1-hl/L. Furthermore, no randomized algorithm achieves a 
lower competitive ratio, even for the web server farm prob- 
lem with one server. 



Proof. The idea behind the algorithm is that with looka- 
head we can compute an optimal path for L time steps. 
However, we may have to make a poor move, in which we 
collect little or no benefit, in order to reach the first state 
on this path. After following this path, we can again choose 
the best state to move to during the next time step, possi- 
bly without collecting any benefit at all, but ensuring the 
best starting point for the next L steps. We refer to this as 
the "resetting" algorithm, since it operates in stages, after 
each of which it resets to an optimal state for the upcoming 
stage. The online algorithm will collect at least as much 
benefit as the optimal offline algorithm on every step except 
the resetting steps. Notice that there are Z> + 1 possible 
"phases": phase j means (potentially) giving up all benefit 
during steps t - (L + 1) + ^' for each i, where 0 < j < Since 
the sets of steps whose benefit is given up is these phases 
are mutually disjoint, the total benefit lost in all phases is 
at most the offline benefit. Randomizing over these choices, 
the online algorithm loses in expectation only a 1/(L -hi) 
firaction of the offline benefit, and the competitive ratio is at 
most jzrrTTLTT) = ^ + V^- 

For any e > 0, we show a lower bound of (L -h 1)/{L -h 
c) for the one-server problem. Our randomized adversary 
generates a demand matrix {di^t} with fl/e] rows and Z + 2 
columns (which can be repeated indefinitely), and a benefit 
matrix of all ones. The first column of the demand matrix 
contains all zeroes, the next L columns contain all ones, 
and exactly one randomly chosen row in the last column 
contains a one and the rest contain zeroes. The optimal 
choice is for the server to move during the first step to the 
row containing the one in column L + 2, and to stay in 
that row for L + 1 steps, collecting a total benefit of L 4- 1. 
Any randomized online strategy stands only an e chance of 
choosing this row, and with probability at least 1 - e must 
either miss the benefit in this column, or forgo benefit in 
some earlier column in order to collect it. Thus the expected 
benefit collected by the randomized online algorithm is at 
most L H- e. □ 
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8. EXPERIMENTAL RESULTS 

We have implemented the offline optimal, discount, and 
intermittent reset algorithms for the web server farm prob- 
lem and measured their performance. We used a randomly 
generated input, and inputs generated from commercial re- 
tail web server logs. The online algorithms perform very well 
on these inputs, both in cases of overloaded and underloaded 
servers. In fact, the discount algorithm with lookahead one 
and the intermittent reset algorithm with lookahead eight 
match the optimal offline performance in the case of under- 
loaded servers. (This is not surprising, since it can be shown 
that if the optimal offline algorithm is able to collect all the 
available benefit, then the greedy algorithm with lookahead 
one achieves this optimal benefit.) The discount algorithm 
performs better, with less lookahead, than the intermittent 
reset algorithm, and is within 0.2% of optimal on all the 
commercial trace instances, even with lookahead one. We 
use the (untuned) value of 2 as the discount factor. 

The random problem instance has 3 customers and 400 
time steps. Demands are generated uniformly between 0 
and 16, and revenues uniform between 0 and 2. 

The commercial traces were collected from two different 
retail sites, which will remain unnamed due to confidential- 
ity constraints. One of these is separated into "browse" and 
"buy" transactions. We. treat these separately (i.e., so that 
we have three different "customers"), and assign revenue 
three times as great to the "buy" transactions since they 
represent actual monetary transactions. Each trace repre- 
sents the requests arriving at a web server over a 24- hour 
period. We separate these into 288 five- minute bucketis, i.e., 
our time unit is five minutes. We scale the inputs using re- 
alistic server rates, which results in demands ranging in the 
low tens of servers for two customers, and up to two servers 
for the third (the one that handles the "buy" transactions). 

Table 2 shows the algorithms' performance. The column 
labeled "Avail." denotes the totaJ available revenue for the 
problem instance, i.e., the revenue that would be collected 
with an infinite number of servers. The next column, labeled 
"Opt" , shows the revenue collected by the optimal offline al- 
gorithm. The next four columns show the revenue collected 
by the discount algorithm and the intermittent reset aJgo- 
rithm with differing amounts of lookahead (one and four for 
the discount algorithm, and four and eight for the intermit- 
tent reset algorithm). All values are rounded to the nearest 
integer. The "Workload" column indicates the trace used 
(random and the two commercial traces, coml and com2) 
and the nimiber of servers. 
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